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ABSTRACT. Semantic and logical problems arising in an incomplete information database are investigated. 
A simple query language is described, and its semantics, which refers the queries to the information about 
reality contained in a database, rather than to reality itself, is defined. This approach, called the internal 
interpretation, is shown to lead in a natural way to the notions of a topological Boolean algebra and a 
modal logic related to S4 in the same way as referring queries directly to reality (external interpretation) 
leads to Boolean algebras and classical logic. An axiom system is given for equivalent (with respect to the 
internal interpretation) transformation of queries, which is then exploited as a basic tool in a method for 
computing the internal interpretation for a broad class of queries. An interesting special case of the 
problem of determining the internal interpretation amounts to deciding whether an assertion about reality 
(a “yes-no” query) is consistent with the incomplete information about reality contained in a database. A 
solution to this problem, which relies on the classical combinatorial problem of distinct representatives of 
subsets, is given. 
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1. Introduction 


For various reasons, the information contained in a real-world database is usually 
incomplete. This creates a need for developing methods to handle situations where 
a database does not contain all the information a user would like to know. 

This paper follows a previous paper of the author [13], where a simple mathematical 
model of a database with incomplete information was introduced. This model, called 
an information system (or just system), is based on attributes which can take values in 
specified attribute domains. Information incompleteness means that instead of having 
a single value of an attribute, we have a subset of the attribute domain, which 
represents our knowledge that the actual value is one of the values in this subset, 
though we do not know which one. This extends the idea of Codd’s aull value [2], 
corresponding to the case where this subset is the whole attribute domain. A simple 
query language to communicate with an information system was also described in 
[13]. This language includes two kinds of queries, terms and formulas (“yes-no” 
queries). The expected response to a term is a list of objects with the property 
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expressed by the term, while the response to a formula is a truth value, T (truth) or 
F (falsity). It was shown in [13] that when the information is incomplete the same 
query can be interpreted in many different ways, and to understand those differences, 
two basic interpretations of a query were introduced, the external one and the internal 
one. The external interpretation refers the queries directly to the real world modeled 
in an incomplete way by the system, so that the external interpretation of a term 7 is 
the set of objects which in reality have property :. Of course, the information 
contained in the system is, in general, not sufficient to determine this set exactly. 
However, in [13] we give methods for computing the best possible bounds on the 
external interpretation of ¢ logically derivable from the system, that is, 


(i) litl, the set of objects for which we can conclude, from the information 
available in the system, that they are in the external interpretation of ż, and 

(ii) ||7|]*, the set of objects for which we cannot rule out the possibility that they 
belong to the external interpretation of £. 


In contrast to the external interpretation, the internal one refers the queries to the 
system's information about the real world, rather than to the world itself, so that the 
internal interpretation of a term ¢ is the set of objects for which the information 
contained in the system satisfies the conditions expressed by t. 

Although in the present paper we deal almost exclusively with internal interpre- 
tation (so that the word “internal” will sometimes be omitted), the results obtained 
provide solutions to some problems concerning the external interpretation which 
were left open in [13]. This paper is organized as follows. In Section 2 we give basic 
definitions and survey those results of [13] which we shall need here. In Section 3 we 
precisely define the internal interpretation of queries and investigate its basic prop- 
erties. In Section 4 we give an axiom system for equivalent (with respect to the 
internal interpretation ) transformation of terms. The technique of equivalent trans- 
formation of terms is then used as a basic tool in an algorithm for computing the 
internal interpretation of an arbitrary term in an arbitrary system. We also prove 
that our axiom system is complete in the usual logical sense, and we explain the 
relation of the notion of a topological Boolean algebra to our semantics of terms. In 
Section 5 we consider a sublanguage which seems to be interesting from the practical 
point of view. Determining the interpretation of a query in this sublanguage is much 
easier than in the general case. In particular, we are able to find the interpretation of 
any formula (which we were not able to do for the general language). Our method 
of computing the interpretation of an arbitrary formula in the sublanguage, which is 
described in Section 6, has a combinatorial flavor and is related to the classical 
problem of distinct representatives of subsets [4]. In Section 7 we discuss some 
alternative approaches to defining the semantics of queries. 


2. Basic Notions 


In this section we give some basic definitions which we shall need in the rest of the 
paper. Some of them coincide with those in [13], to which the reader is referred for 
more detail and motivation. 

By an information system (or a system for short) we mean a triple 


f= (X, (Dien U), 
where 


(i) X is a finite set of objects, 
(ii) J is a finite set of attributes, 
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(iii) D is a nonempty set called the domain of attribute i, 
(iv) U is a function which associates with every attribute / and every a E D; a set 
U(i, a) G X, such that for every i € J, 


U{U(i, a):a E Di} = X. (1) 


Intentionally, U(i, a) is the set of objects for which attribute i possibly takes value a. 
According to this interpretation we can determine for every x E X and every 
i © I the set 


Bix) = {a © D;:x E€ U(i, ay} (2) 


of all possible values attribute i can take for object x. Conversely, U can be obtained 
from functions £;, i E J, by the formula 


U(i, a) = {x E X:a € Bi(x)}. (3) 


We always assume that the set X of objects, the set J of attributes, and the attribute 
domains Dj, i E J, are fixed, and we often represent a system by functions ĝa i € J, 
rather than by U. 

Notice that a system represented by £; i E J, may be treated as a relational model 
[I] with only one relation. However, in our case this relation consists of tuples 
(di, ..., An), where each A; is a subset of D; rather than an element of D; 
(Ai = Bix). 

For two systems 4 = (X, (Dien Ui) and % = (X, (Dien U2), we say that A is 
an extension of A (in symbols, A < Aor A= A) if 


U2(i,a) C Ui(i,a) forall iG 7 and a€ Di, (4) 
or equivalently, 
Bi(x)C B(x) forall ¿EJ and xEX, (5) 


where 8; and $; i E I, correspond to A and &, respectively. 

Intuitively, A < S means that the knowledge represented in & is contained in the 
knowledge represented in “%. Of course, < is a partial order. A system is called 
complete if U(i, a) N Uli, b) = Ø for all a, b E Dj, a # b, or, equivalently, if Bi(x) 
consists of a single value for all i € J, x E X. For a theory of complete systems see 
[14]. A complete extension is called a completion. 

Our query language consists of terms and formulas. Terms are built up from certain 
elementary parts called descriptors and from constants 0, 1 by means of symbols for 
Boolean operations —, +, -, —, and a special unary operation C. Every descriptor is 
of the form (i, A), where i E J and A G Di, more exactly, A is in a fixed Boolean 
algebra 4; of subsets of D,. Descriptors will be informally written as (LENGTH = 


50), (SEX = M), (SAL < 10000), (COLOR = RED) or simply red, etc. An example 
of a term is 


—((DEPT# = 2)- (NAME # SMITH) + (SAL < 50000) - (AGE < 30)). 


The set of terms is denoted by 7 

Formulas are built up from atomic formulas t = s, where t, s € Z and from logical 
constants T, F by means of logical connectives 7, V, A, =>, and a special (modal) 
unary connective (J. Finite disjunctions and conjunctions are abbreviated as 
Wies P; and Mjes Bj, respectively; 7(¢ = s) is denoted by ¢ # s. An example of an 
(atomic) formula is ((SEX = F) .—(SAL > 10000)) = 0. The set of all formulas is 
denoted by Æ Intuitively, both © and O mean “in every possible extension of our 
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present knowledge.” A query (term or formula) is simple if it contains neither 0) 
nor O. , 

The interpretation of a query Q in a complete system & called the value of Q in 
Z and denoted by || Q || or just || Q||, is defined in the natural way. We put 


Ii, Ay] = U, UG, a) = {x € X:Bix) C A), (6) 


and we interpret 0, 1, —, +, -, — as Ø, X, and the set-theoretical operations of 
complementation, union, intersection, and the operation “(X\A) U B,” respectively; 
for formulas we define ||: = s = T iff i: = isl, and we interpret the logical 
connectives in the natural way. Symbols [J and CO are simply disregarded (there can 
be no proper extension of a complete system). Two queries Q, and Qz are externally 
equivalent (in symbols, Qı Ze Q2) if || Qi |= || Qe||. for every complete system Z 
(here D;, i E J, are fixed, but X and U are arbitrary). In [13] we gave an axiom 
system B for externally equivalent transformation of queries, consisting of the 
following axiom groups: 


B1. Substitutions of terms into the axioms of Boolean algebra, and the axioms of 
equality (see, e.g., [15]). 
B2. The following axioms concerning descriptors: 


(i) (i, D) = 0, (i, Di) = 1, 
(ii) (4A) + (i B) = (i AU B), 
(iv) —(i, A) = (i, Di\A), 


for all i € J and all A, BE Z: 
B3. Substitutions of formulas into the propositional calculus axioms. 
B4. The axioms [Jt = ¢ for every t E Zand O® = © for every BEF 


(In fact, B4 is missing in [13], where we consider only simple queries.) It can easily 
be proved that the axiom system B is complete, that is, Qı Ze Q2 iff Qi can be 
transformed into Q- by using axioms in B. Our main task in this paper is to generalize 
the definition of the value of a query to arbitrary (not necessarily simple) queries and 
to arbitrary (not necessarily complete) systems, and then to axiomatize this extended 
notion of value in the same way as the value of queries in complete systems is 
axiomatized by the axiom system B. We begin this program by giving, in the next 
section, a precise definition of the value || Q ||- of an arbitrary query Q in an arbitrary 
system S 


3. Internal Interpretation of Queries 


Before introducing the formal definition of || Q|| for arbitrary Q and & let us first 
give some intuitive ideas connected with it. Our definition is based on a consistent 
approach where any query Q is interpreted as expressing an internal (with respect to 
the database) property of objects (if Q is a term) or a property of the database as a 
whole (if Q is a formula); in other words, a term or formula expresses some conditions 
on the information available about an object or about the whole collection of objects, 
respectively. Any descriptor (i, A) will be understood as “known to have the value 
of attribute į in A,” and the symbols —, +, and - will be interpreted as the usual set- 
theoretical operations of complementation, union, and intersection, respectively, in 
exactly the same way as in the case of complete information. Notice that, in particular, 
the interpretation of red + blue is “known to be red or known to be blue” (rather 
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than “known to be [red or blue]”); similarly, —red is interpreted as “not known to be 
red” (rather than “known not to be red”). 

The interpretation of Er will be, roughly speaking, the set of all objects not only 
having (internal) property t now, but also in every—not necessarily complete—conceiv- 
able extension of our present knowledge. The interpretation of O® is similar—it is T 
if and only if the assertion concerning the information on our collection of objects, 
expressed by ®, is bound to remain true in every possible extension of our present 
knowledge. It should be emphasized that our interpretation of queries is intended for 
a user who is fully aware of the fact that the information available in the system may 
be incomplete, and who may explicitly refer to this incompleteness in his queries (by 
using O and O). 


Definition 3.1. Let Z= (X, (Dier U) be an arbitrary query. The value of Q 
in Z denoted by || Qll (or || Q|| when ¥ is understood), is defined inductively as 
follows: 


O WADI = (© XB) CA}; 
Gi) [o= l=; 

(ii) -= XN; 

Gv) e+ f= [a U Isl 

() [e-sl= MLO Usk 

(vi) le> sl = ANI) U Isl 

(Vil) [Ers (x E X:for every 77 = 2 x € ile) = A tes 


(viii) IFI=F,_ITI= T; 
œw wast={7 if el = lsh, 


otherwise; 
œ) ISl = e |; 
(xi) |]®V ¥I =O] vv; 
(xii) [DAY] =O] Ale]; 
(xiii) |= +| =] v | eI; 


ID] = T if [ls =T forevery Z >Z 
| 7 (2 otherwise 


inf {|| dle P = SY}. 


(xiv) 


(inf refers to the natural ordering F < T.) 


It will be convenient to denote -0-—¢ by ©1, for any term 1, and -107@ by O®, for 


any formula Ọ. ©¢ and OỌ have a natural interpretation, given by the following 
theorem. 


THEOREM 3.1 
(a) For any term t and an iy system E 


Orly = {x EX: for some fH >SFxE lelle} = yN lello 
‘=F 


(b) For any formula ® and any system F 


T ifforsome PZF ||P. = 7, 
Oo], á 
F otherwise 
sup{|| le F > A. 


I 
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PROOF 
(a) Hers = -E-l =X NA [tlle = XV XN elle) 
PEF TEF 
= X\K\ UY Melle) = U elle 
LES Mes 
= {x € X: for some F = Z x E ftlo}. 
(b) OO, = [078 A |}, 


inf {|| je P = S} = A7sup {Olle :. SH’ = F} 
sup{|| Olle: Z = A 
= Y if for some =F (ðe = T, 


F otherwise. o 


It should be emphasized that in Theorem 3.1 as well as in Definition 3.1 (vii) and 
(xiv), Z’ is not assumed to be complete. If an operation CY were defined to be like 
E, except that “for every S’ = S” is replaced by “for every complete S’ > FY,” then 
CY and CJ would not coincide. We have, for instance, ||O((SEX = M) + (SEX = 
F))||+ = X, since in every completion S’ of F || (SEX = M) + (SEX = F) ||, = 
| 1 |» = X. On the other hand, if the value of SEX for an object x is not known in 
S, then 


x @ || (SEX = M} + (SEX = F) |v = || (SEX = M) |v U]] (SEX = F} lv, 


and consequently x ¢ ||E((SEX = M) + (SEX = F))]|+ (notice that one of the 
extensions of Z is Y itself). We see in the next theorem that LY can be expressed 
by EQ. 


THEOREM 3.2. For any system ¥, any term t, and any formula ®, 


(a) HEt = {x E X: for every completion F of F, x € \\t\lv}; 
(b) ||OEIt||>= {x E X: for some completion F’ of F x € |\tl| +3; 


_ |Z ifforevery completion ¥' of P, le =T, 
ene G otherwise; 


_$T  ifthereisa completion S' of f with ||Olly =T, 
(d) joUels= p otherwise. 


Proor. The theorem follows from the structure of the partial order <, more 
specifically, from the fact that for any Z there is a maximal element (i.e., a complete 
system) S’ = SF Let a(F) be an arbitrary assertion with variable S ranging over 
systems, which for any particular Y may be either true or false. Then 


GQ WH = SAL = SMF") 
is equivalent to l 
(ii) for every complete S’ > F AF’). 
Indeed, (i) implies 
(for every complete Y’ > SPIS” = F')a(H”), 


which is equivalent to (ii), since for complete S’ the only S” = F’ is F itself. 
Conversely, assume that (ii) holds. Then by the structure of <, for every Y > S 
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there is at least one complete S” > S’. But S” = S; hence by (ii), a(Y”) is true, 
which means that (i) holds. Taking a(/) to be x € ||f||.4 we obtain 


x€|HOrll- oe WS’ = Px E Orly 
o VP = NAF = Px E lille 
= for every complete S’ > F x €E |e, 


which proves (a). Similarly, taking a(S) to be ||® || -we have 


[Dodje e WAH = ANAF = PYNE 
© for every complete S’ > SZ, || Ol, 


which proves (c). 
To prove (b), we have 


e0 = || -G-Br|y = | -G-O-—1]| y= -08-2 XDS- s. 
Hence, by (a), 
x E lSBDrl| e x g ||AO-7|- 


<= (for every complete Y’ > Z, x E ||-t|-) 
<= for some complete S’ > S, x E |[t|]-. 


Similarly, || OO® || = “|Om ||, and we obtain (d) from (c). O 


Two queries Qi, Q2 are said to be internally equivalent (in symbols, Qı =; Qə) if 
10: - = || Qll v for every system ¥ (as in the definition of external equivalence, Dj, 
+€ J, are fixed, but X and U are arbitrary). It follows trivially from the definition 
that the internal equivalence is stronger than external, that is, Qı =; Qz implies 
Qı =. Q:. Of course, the converse implication does not hold. For instance, (i, A) + 
u. B) =. (i, A U B), but in general (if A # A U B # B), (i, A) + (i, B) Fi 
u. AUA). Indeed, 


(i, A) + (i, B)|| = || (i, A U B} || = {x © X: B(x) C A U B), 


and a subset of A U B need not be a subset of either A or B. Another example is the 
equality —(i, A) = (i, D;\A), which is of course true under the external equivalence 
and is not true (except for trivial cases) under the internal equivalence: 


|i, A) |] = X\| i, A) | = X\ {x © X: B(x) C A} 
= {x E X: B(x) N (DAA) ¥ Ø}, 
while 


<i, DA ) |] = {x E X: B(x) c DAA}. 


When we put these examples into more concrete terms, (SEX = M) + (SEX = F) 
s interpreted as the set of persons whose sex is known, while (SEX, {M, F}) is 
interpreted as the whole set X of persons; similarly, — (SEX = M) is interpreted as 
the set of persons who are not known to be men, while (SEX # M) as the set—in 
general smaller—of persons known not to be men (i.e., known to be women). 


4 Axioms for Internal Interpretation of Terms 
So far we know exac 
3.1), but we do not k 
evaluating the valu 


tly what the internal interpretation of a query is (see Definition 
now how to compute it. In this section we develop a method for 
e of an arbitrary term in an arbitrary system. (Formulas are 
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treated in the next section, where we give simple algorithms for determining the 
internal interpretation for queries of a special type.) 

Our method of computing |{t||- is based on transforming f into some equivalent 
term for which determining || - | is easy. The transformation process is based on a 
set of axioms which completely axiomatize the internal equivalence, in the same 
sense as the axiom system B completely axiomatizes the external equivalence. 

In order to develop our axiom system, we need some facts about topological 
Boolean algebras. 


Definition 4.1. A topological Boolean algebra (TBA for short) is an algebra 
(B, Fam; I, 0, 1) 


such that (B, +, -, —, —, 0, 1) is a Boolean algebra and II is a unary operation with 
the following properties: 


(i) U(a-b) = la. ib, 
(ii) Masa, l 
Gii) Wa = Ia, 
(iv) Wi=1, 


for all a, b E B (a = b abbreviates a-b = a). 

An example of a TBA is the Boolean algebra of subsets of a topological space with 
the operation of taking interior as I. A thorough study of TBAs, as well as the 
explanation of the elementary topological notions used here, can be found in Rasiowa 
and Sikorski [15]. The next lemma gives an example of a TBA which plays an 
important role in our considerations. 


LeMMA 4.1. Let ( 2%, =} be a partially ordered set. For all A, BC Z, let 


WA = {xE &:for every y= x, yE A}, (7) 
A= B=(2\A)UB, 
-A = N\A. 


Then 
(PAT) U, N, =, —, I, 2, #) 
(where A(X) denotes the set of all subsets of X) is a TBA. 


Proor. It is sufficient to show that the operation I defined by (7) satisfies 
conditions (i)-(iv) of Definition 4.1. 
(i) WAN B)= {x€ F: (Yy>zx)yE ANB} 
= {x€ T:(Yy>=x)y ZA} N {xE Z:(Yy>x)yE€ B} 
= 4 N UB. 
(iii) IIA = {x E Z: (Yy > x) (Yz = y)z € A} 
= {xE £2:0Vz2=x)zE AJ =A. 
(ii) and (iv) are left to the reader. O 
Notice that the example provided by this lemma is a special case of the afore- 
mentioned general example based on a topological space. Indeed, (7) defines a 
topological interior operation and hence endows 2% with the structure of a topological 
space. It may be noted that the subsets 
{yE Ly=x}, xE 


form a basis of this topological space. 
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I behaves as a topological interior operation, and, similarly, the operation C, 
defined by 


Ca = —I—a, 
can easily be shown to have the properties of a topological closure operation: 
Cia + b) = Ca + Cb, 


CCa = Ca, 
a = Ca, 
co =0. 


The role of TBAs in the internal interpretation of terms is analogous to that of 
Boolean algebras in the external interpretation. The analogy is that we can perform 
internally equivalent transformations of terms using the axioms of TBA, more 
exactly, the axioms listed below. 


Axioms for Terms Under Internal Interpretation. The set TB of axioms consists of 


TB1. Substitutions of terms into the axioms of TBA, that is, axioms of Boolean 
algebra, and 


O O-s) = Gt-Os, 
qi) tE = Orr, 
(ii) OG = Gy, 
(iv) Gl=1. 
TB2. The following axioms concerning descriptors: 


(v) (i,9) = 9, 
(vi) (i, Di) = 1, 
(vii) (i, A). (i, B) = (i AN B), 


forall i E I, A, B E€ &. 
TB3. The axiom 


k Np k np 
vid OY (-« Ap) + D (ip, Bn) = X > (ip, (Dip\Ap) U BZ), 
p= q=l 


p=! q=} 
for every positive integer k, every sequence of positive integers M, ..., me, every 
sequence of distinct attributes i, ..., iz, and all Ap, Bp, ...,ByPE B,, lSpsk. 


The last axiom is fairly complicated, but it is hoped that its role will become clear 
later, when we define the weak multiplicative normal form (see Definition 4.2 below). 

Before proving that the axiom set TB properly axiomatizes the internal interpre- 
tation of terms, we shall give an intuitive explanation why the notion of a TBA is 
relevant in the context of internal interpretation, more exactly, why transforming 
terms according to the axioms of TBA preserves internal equivalence. To this end we 
consider the partially ordered set ( Z, <), where Z is the set of all systems with fixed 
X and (Di)icr. This partially ordered set defines a TBA 


Á= (A(X), U, an, =>, =, I, ©, X) (8) 
(see Lemma 4.1). For any x € Xand tE J let 
St) = {LE Lx E ltl}. (9) 


It will be proved (see Theorem 4.1) that fx preserves all TBA operations, that is, 
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FAG) = D(A, f-t + s) = f(t) U f(s), etc. This means that, loosely speaking, the 
symbols LJ, +, . . . can be interpreted as operations in some TBA. Using this fact, one 
can easily prove that if ¢ = s can be derived from the axioms of TBA, then f(t) = 
f(s) for every x € X, and consequently || || = || s||,-for every S, that is, t =, s. 

Let us formulate and prove this more precisely. We write ¢ =7p s if ¢ can be 
transformed into s by using the axioms in TB. Obviously, ~rg is an equivalence 
relation on the set Z. 


THEOREM 4.1 (ADEQUACY OF THE AXIOM SYSTEM TB). For any terms t, $, 
tXrps implies t Zi s. 


Proor. It is sufficient to prove that if t = s is an axiom in TB, then t =; s. To this 
end, let us consider the partially ordered set (2; <) of all systems with fixed X and 
(Di)ics, ordered by the relation of extension. The set A(X) of all subsets of Z, 
together with the usual set-theoretical operations and the operation II defined for 
every AC F by 


WA = {FE £:forevery Z =F, Sf EA}, 


defines a TBA æ (see (8)). For any x € X, let the mapping 
fer FT > AX) 
be defined by (9). The mapping fx has the following properties: 


full + 8) = felt) U fl), 
SAES) = felt) Af), 
fat > 8) = ft) = fal), 
ft) = fill), 
O= 
fal) = Z, 
FAD) = fei). 


We prove the first and the last property. The others are left to the reader. 


ft + s)= {LSE Ux E |t +t sly 
{FE Bx E|ltlly Vx E lisle 
{FE Tx E |e] U {FE Fx Ells] a 
= felt) U fr(s). 
JON = {FE Xx E [Orl 
= {LE NF > F) x E lille} 
= {LE RWS = P) P EfAt)} 
= If). 


Let ¢ = s be an axiom of group TBI, that is, a substitution of terms into an axiom 
of TBA. Since this axiom holds true in A, we infer from the properties of fx that 
S(t) = f(s) for every x E X. We illustrate this for axioms t-(s + r) = t.s + t-r and 
E(t- s) = e-s: 


Fls + P) = fD N E) UO) 
=D NALO U (felt) Nf) = filles + t, 
FES) = WO fal) = O N Wf) = f(t Ls), 


| 
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Now we prove the same for axioms of group TB2. As (v) and (vi) are trivial, we 
restrict ourselves to (vii): 


SAGA) (i, B)) = {PE Fx E |] (i, A). (i BY LD 
= {FE Ex El, A) oN Ii BYA 
= {SE F: B(x) C A A BAx) C B} 
{SE X: B(x) CAMB} 
{PE B:x El (ZA NB) = fi, AN BY). 


ou 


(Here $; i E J, denote the functions uniquely corresponding to S.) 
The last axiom to consider is (viii). This is the most difficult part of the theorem. 


k np 
fF [-(ip Ap) + F (ip BE) 
p=1 q~i 
k np 
= {ve BMS! = S)x ev, [1-6 Ay) lle U y, || (ip, BF) I|} 


= {FE LNP = NEP E || ip Ap) le V gx E || lip Be) Ile ]}. 


Taking into account that 


x E l| (ip, Ap) Ilo (Ap C B), 
x E || (ip Bp) lly BŒ) C BP, 


(Bi, i € I, correspond to S), and using (5), we obtain 


k Np 
SE AGP |- Ap) + X (ip, sn |) 
p= q= 
= (for all nonempty Zı G a(x), ..., Zk C Bi,(x)) 
(Ap) l~(Z, E Ap) V (Ag) Zp C By] 
= (Sp) (for every nonempty Z C fi,(x))[Z C Ap = (3q) Z C Bp) 
«> (ap) (for every nonempty Z G B(x) N 4p) Gq) Z C BP 
= (Sp) (3q) Bi,(x) N Ap G BP 
e (3p) (3q) Bi,(x) E (Di,\Ap) U BY 
k Np, 
exe ei = Il (i, (Di,\Ap) U BY) lls 
k Mp 
= SE fx ( E È (ip (Di,\A) YU B9). 
p=l g=1 : 
To sum up, we have proved that for every axiom t = s in TB and for every x E X, 
F(t) = f(s). But this means that for every such axiom, t =; s. Indeed, 
tlle = {x E X: SE f.(t)} = {x E X: SE fA) = lisli 


Now the theorem follows from the fact that each time we use an axiom t = s in the 
transformation process, we replace an occurrence of t by s in some term r. Since 
t =, s, this operation does not change the value of r. O 


Putting our proof into more algebraic terms, we may illustrate it by the following 
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commutative diagram: 


glilis P) 
S p 

I] x 

xEX 


where [[xex Xx denotes the direct product of TBAs .;, each £. being a copy of 
A= (P(X), U, N, =>, —, I, Ø, X), and where f = (fz)xe x and 


(a) = {x E X:FE az} 


for every a = (ax)zex © []xex &%. The theorem follows from the fact that fisa 
homomorphism (i.e., it preserves all TBA operations) and that [xe x %, is a TBA. 
We prove in this section that the converse of Theorem 4.1 holds as well, which 
means that the relations =; and =rg coincide (see Theorem 4.4 below). 
Using Theorem 4.1, we can obtain some useful corollaries from the axioms in TB. 


LEMMA 4.2 


(a) OG, A) =: (i, A); 
(b) (i, A) zi (i, DNA); 
(c) Eli, Ai) + +++ + (ik, And) Zi (i, A1) + ++ + Cir An) 
for arbitrary (not necessarily distinct) i, ..., ik; 
(d) EX-(i, Ai) + +++ + — (i, Ar)) Zi (i, DAA: N +++ N Aa)); 
(e) Ei, Ai) + +++ + — (in Ak)) Zi (i, Di\A1) + +e + Cin, D; MAr) 
whenever iņ„..., ix are pairwise distinct; 
(f) Oli A) = —(i, Di\A); 
(8) O-(, A) = —(i, A); 
(h) Olli, A1) + +++ + (ir, Ar)) Zi ~(i, Di\A1) + «++ » (in, DiMA) 
whenever th,...,i, are pairwise distinct; 
(i) Oli, Ar) + +++ + (i, Ar)) Zi —(i, D\(A U +++ U An); 
(J) ®(—(i, Ai) +++ + — (ik, And) Zi — (i, Ar) oe + — (ir, Ar) 
for arbitrary (not necessarily distinct) in ..., in. 


PROOF. (a) through (e) correspond to some special cases of axiom (viii). For 
instance, 


d-i A) = Oli A) + (i, D) z (i, DAA). 
(f) through (j) can be derived from (a) through (e). For instance, 


O((h, A1)+ +++ + (ix, Ak)) Zi -O-—( (i Ar) +++ + (ik, Any) 
Zi (~(i, Ai) + +++ + — (ix, An) Zi (Ch, Di\A1) + +++ + (ing Di,\Ax)) 
=i ~(h, Di,\A1)+ +++ + — (ir, Di MAk). o 


Now we are prepared to describe a method of determining the value of any term 
in any system. The main tool in our approach is a certain normal form for terms. 


Definition 4.2 (i) A term is weakly coprimitive if it is of the form 


k m 
X (nan + > (ip, Bi), (10) 
p=l q=1 


where the attributes i, .. . , i, are pairwise distinct. 
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(ii) A term is in weak multiplicative normal form (WMNF) if it is of the form 


II le, 


where all z,’s are weakly coprimitive. 
Notice that any term in WMNF is simple. If we had axioms 


~(i, A) = (i, DAA), 
(i A) + (i, B) = (i, AUB), 


then we would be able to transform (10) into 

k m 
5 (ip (Di,\Ap) U U B8). 
p=) g=i 


However, as we have already seen, these axioms are not valid under the internal 
interpretation. 


Lemma 4.3. For any simple term t there is a term s in WMNE such that t =rp S. 


Proor. We shall describe an effective algorithm for transforming f into an 
internally equivalent term s in WMNF. Using the axioms of Boolean algebra, we . 
transform ¢ into a product of sums, each sum consisting of some number of descriptors 
or negations of descriptors. Each of these sums can be transformed into the form 
where descriptors are grouped according to attributes, say 


k Mp np 
X (3 —(ip, Ap) + b} (ip, Ba) (11) 
p=] \r=1 q=1 
(i, ..-, ią pairwise distinct). By de Morgan’s Law and axiom (vii) of group TB2, 


m f Mp Mp 

2 —(ip, Ap) ZTB -JĮ (ip, Ap) STB ~(ip, n Ap). 

re r=] pe 

This enables us to transform (11) into a weakly coprimitive term, and consequently 
to transform the whole term into WMNF. 


THEOREM 4.2. For any term t there is a term s in WMNF such that t ~re S. 


Proof. First we describe how to eliminate [C] from ¢. If t contains a C, then ¢ must 
contain a subterm of the form Elp with no E) occurring in p. By the previous lemma, 
p can be transformed into a term in WMNF, say p’. But axiom (viii) enables us to 
directly transform Op’ into a term in WMNF. In this way the number of occurrences 
of FJ is decreased by one. By repeating the above procedure we ultimately eliminate 
C from 1. Now it suffices to transform the resulting simple term into WMNF, which 
is possible by the previous lemma. O 


Notice that the proof of Theorem 4.2 provides an effective algorithm for computing 
the value of any term in an arbitrary system. Indeed, in order to evaluate ||z|| for a 
term ¢ not containing CJ, we can directly apply Definition 3.1(i)-(vi). For general 
terms, we give the following convenient formulation of our method of determining 
the internal interpretation. 


THEOREM 4.3. For any term t and any system Fx E |t\|x if and only if after 
transforming t into WMNF, for every factor 
k np 
2 (-« Ap) + > (ip, Bs) (12) 
q=1 


p=l 
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of this WMNF there is an attribute i, such that 
Bi (x) È Ap or forsomeq, P(x) G Bp. 


Proor. The proof follows directly from Definition 3.1 and from the fact that our 
process of transforming into WMNF preserves internal equivalence. O 


Example 4.1. Lett be the following term: 


— (AGE < 40) -E((SAL > 10000) -~(SAL > 20000) (13) 
+ (SEX = F) + (SEX = M)). 


This query asks for objects with the value of AGE not known to be less than 40 (it 
may be known to be 240), which not only now but also in every possible extension 
of our present knowledge have the following (internal) property: Either the value of 
SAL is known to be greater than 10000 and is not known to be greater than 20000 
(it may be known to be =20000), or the value of SEX is known. We transform t into 
WMNF: 


t z; (AGE < 40) -EX(((SAL < 10000) + (SEX = F) + (SEX = M)) 
-(— (SAL > 20000) + (SEX = F) + (SEX = M))) 
=; — (AGE < 40) -E (SAL > 10000) + (SEX = F) + (SEX = M)) 
-EX— (SAL > 20000) + (SEX = F) + (SEX = M)) 
=; ~(AGE < 40)-((SAL > 10000) + (SEX = F) + (SEX = M)) 
-((SAL = 20000) + (SEX = F) + (SEX = M)). 


(It is easy to see that in this particular case ¢ can be further transformed into 
—(AGE < 40)-((SAL IN (10000, 20000]) + (SEX = F) + (SEX = M)).) Now 
consider the following system: 


object AGE SAL SEX 
x) (0, œ) [0, œ) {M} 
Xe {30} {20000} {F} 
X3 (20, œ) (15000, 30000) {F, M} 
X4 (35, 45) {15000} {F, M} 


Using Theorem 4.3, we obtain the value of (13): 
lell = (2, x4}. 


Note that in some cases it is useful to exploit the fact that 
zrs —( A) + (i ANB) (14) 


for transforming every factor (12) of a WMNF into 
k Np 
x (-« Ap) + Y (ip APN Bs). 
q=l 


p=l 


In this way we can eliminate from (12) the summands (ip, Bf) with Ap N Bg = Ø. 
We can use the technique of transforming into WMNF to prove the following 
theorem. 


THEOREM 4.4 (COMPLETENESS OF THE AXIOM SYSTEM TB). For any terms t, s, 


ts implies = t =p s. 
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Proof. Assume that ¢ =; s, and transform (—t¢ + s)-(—s + t) into a term r in 
WMNF, r =ra (—t + 5)-(~s + f). We shall prove that r =rp 1. Indeed, otherwise r 
would contain a factor r;, say 

k n 
È (-( Az) + Y (ip an), 
gq=i 


p=1 
with A, € Bf for 1 spk, 15 qs np. Notice that if A, C Bf, then, by (14), 
~(ip, Ap) + (ip, BE) Srp —(ip, Ap) + (lip Ap N BZ) 
=p — (ip, Ap) + (ip, Ap) rB l. 


Consider a system Z with B:(x) = D: for all i E J, x E€ X, and with a nonempty set 
X of objects. By axiom (viii) we get 


k n 
XL (ip, (Di,\Ap) U Be) 


p=! q=l 


|Srill = 


s 
k np 

= U U || (i, (DiMA) U BP) Ily 
p=lq=1 


k np 
=U U =ð, 
p=1 q=l 
since Ap È BJ implies (D;,\Ap) U BZ # D; Let us fix an object x € X. By the 
definition of |r: + (see Definition 3.1(vii)), there is a system S’ = S such that 
x & |lril]~, and consequently x Æ iirhs (Urle C lirie). On the other hand, since 
t =; s, we have [cll] = sl and 
Ire = [t+ s) s + Olly 
= (X\Itls) U Iso) O CAs) U Hell) 
=XNX=xX, 
that is, x € |r|]. This contradiction shows that r ~rs | must hold. Hence we have 
t rp ter Srg t-(—t + s) (—s + t) Sre t-(-t + 5) 
Srp t- (t + s) (~t + s) Srp t(s + (¢-—-t)) Sre t-s. 
Similarly s =rs t-s, and consequently t rg s. O 

Combining this result with Theorem 4.1 we see that the relations =; and ~rs 
coincide. 

In some cases it may be convenient to use another axiom system based on © as a 
primitive operation (CJ can be expressed as —©—). The reader may easily verify that 
a (dual) complete axiom system TB* can be obtained from TB by replacing axioms 
(i)-(iv) and (viii) by 

G)* Olt +s) = Ot+ Os, 


Gi)* 6-Or=4, 
(iii)* OOr= OF, 
(iv)* ©0=0, 
k Np k Np 
(viii)* eJ] G Ap); II —<ip, 23)) = J] I] ~(i» (Di,\4,) U BP) 
p=) q=1i p~l gal 


(with the same restrictions as in (viii)). 


Using these axioms, we can transform any term into weak additive normal form 
(WANF), that is, a sum of weakly primitive terms of the form 


h Np 
H (a-i — (ip, Bs) 
p=} g=l 
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(4, ..., İp pairwise distinct). Computing the internal interpretation is then carried oy! 
analogously as for WMNF. A 

Now we briefly discuss the internal interpretation of formulas. We do not give any: 
general method of computing ||® || +; no such method is known to the author. 

A reader who is familiar with the Kripke models for the modal logic S4 (see. 
Kripke [8] and Fitting [3, Ch. 3]) has undoubtedly noticed the similarity between 
Definition 3.1 and the definition of truth value of a formula in a Kripke model. An 
immediate corollary from this similarity is that transforming formulas according to 
the axioms of modal logic S4 preserves internal equivalence. For a discussion of S4 
the reader is referred to [3, 6, 8, 15]. Here we only note that TBAs play in S4 the 
same role as Boolean algebras in the classical logic; that is, an expression is an S4. 
tautology if and only if its value is 1 in every TBA. An important difference between 
Definition 3.1 and a Kripke model is that the latter can be based on an arbitrary 
partial order, while our partial order < (the relation of extension) has some specific 
properties; for example, for any Z there is a maximal element (complete system) 
F’ = F. This has the result that there are formulas which have value T in every 
system, yet which are not (substitutions of formulas into) S4 tautologies. Examples 
of such formulas are 


Oo = OD, 
DO A F) e 008 ADF, 
SDS v $) e OD v ODY, 


(® = Y abbreviates (® = Y) A (Y = ®); see Theorem 3.2 for the intuition connected 
with these formulas). Other universally valid formulas are, in our case, 


Cit = 0) = Or = 0, 
Oe 0) =r 40, 
Ot = 0) = Hr =0, 
O(t #0) = O14 0. 


It is not known to the author whether all these formulas completely axiomatize the 
internal equivalence of formulas (i.e., whether an analog of Theorem 4.4 holds). 
While the problem of a complete axiomatization of internal equivalence, as well as 
that of evaluating ||® ||~ for any ® and Z remain interesting logical open questions, 
it seems that the method of determining the internal interpretation for formulas of a 
special kind which we describe in the next section is quite sufficient for practical 
purposes. 

Let us finally mention that our terms and formulas can be treated as the open and 
closed formulas, respectively, of a certain monadic modal predicate calculus (see 
Lipski [12)). 


5. Internal Interpretation: A Simplified Language 


The internal interpretation of queries described in the preceding section, although 
precisely defined, may be not intuitively clear for a user, especially a casual one. The 
main reason seems to be the fact that the meaning of the operation C] and the 
connective L) is less lucid than the meaning of the (formally more complicated) 
operation []© and connective OO (see Theorem 3.2). A user, who is in most cases 
interested just in deducing as much information about reality as possible from 
incomplete data, is likely to think of the system in terms of all completions of the 
information available in the system, that is, all possibilities of what reality may turn 
out to look like. On the other hand, in the definition of ||EJr|| and |[O®]| (see 
Definition 3.1(vii) and (xiv)) we take into consideration all extensions, not necessarily 
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complete. Each such extension may be thought of as an intermediate stage in a 
hypothetical process of increasing the information contained in the system. What we 
do in the interpretation described in the preceding section is, in a sense, indirectly 
define (internal) properties of objects (or of a system as a whole) by specifying 
roperties of the possible processes of increasing knowledge. It seems that such an 
expressive power may not be necessary in a query language. With this in mind, we 
now propose a certain subclass of queries as a basis for the query language. 
Let us denote Ort by surely t, for any term £t. The set A of special terms is defined 
to be the least set 7’ with the following two properties: 


(i) 0, 1, and every descriptor are in 7’, 
(ii) —t, surely t, (t + 5), (t-s), and (t — s) are in 7’ whenever t, s E€ 7". 


If we denote —surely —t by possibly t, then 
possibly t =; -D-t =; -G-A =; ©, 
and by Theorem 3.2, for any term ¢ we have 
surely te= A{|ltlv:S’ is a completion of S}, (15) 
|| possibly tls = U {tlle : Z is a completion of P}. (16) 


Similarly, by introducing the abbreviations Surely ® for 0O® and Possibly ® for 
“Surely ®, we define the set A of special formulas to be the least set F’ with the 
following two properties: 


(i) T, F, and every special atomic formula t = s (t, s E J) are in F”. 
(ii) 7, Surely ®, (© V Y), (© A Y), and (® => Y) are in F’ whenever 


Q, FEF. 
As before, Theorem 3.2 implies 
|| Surely P|], = inf{|]®]] +: is a completion of 7}, (17) 
|| Possibly ® || >= sup{||®]].»:Y is a completion of P}. (18) 


The following lemma gives some useful properties of surely. 


Lemma 5.1 
(a) surely (i, A) =i (i, A); 
(b) surely —(i, A) Zi (i, D\A); 
(c) surely (t-s) =; surelyt-surelys; 
(d) ift zes, then 
surely t Z; surely s. 
PROOF 
(a) By Lemma 4.2(b), 
surely (i, A) = O-Ol-(i, A) = O—(i, Di\A) 
= (i, D\(Di\A)) zi (i, Di). 
(b) We can prove (b) similarly, using Lemma 4.2(a) and (b). 
(c) By (15), in any system Z we have 
|| surely (t-s) = N {llr s]: is a completion of } 
= N {llle N isll: is a completion of S} 
= Nille: is a completion of S} 
NN{sllv:H' is a completion of S} 
= |[surely tl N || surely s|lyv 
= ||surely t-surely s|. 
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(d) t Ze s means that ||t||-- = |] s|]-- for any complete system ¥’. Hence, by (15), 
|| surely tl] = N{||t|]y-:H’ is a completion of S} 
= N (liss: is a completion of S} 
= ||surely sls; 
that is, surely t =; surelys. O 


By virtue of Lemma 5.1(d), if an occurrence of surely is within the scope of another 
occurrence, then the former can simply be deleted. Using this rule, we can easily 
transform any special term to a Boolean combination of descriptors and terms of the 
form surely s, s not containing surely. (Moreover, any reasonable term entering a 
database is, most probably, already of this form.) 

In order to eliminate surely from the resulting term efficiently we need another 
lemma. First, however, we give some auxiliary definitions. 

A term is said to be in additive normal form (ANF) if it is a sum of primitive terms 
of the form [] jeu (Ñj, 4;), where ip ¥ i, for p # q, and Ø # A; # D; for all jeJ, 
Similarly, a term is in multiplicative normal form (MNF) if it is a product of coprimitive 
terms of the form je, (i;, Aj), where, as before, all the attributes i; are different and 
none of the descriptors reduces to 0 or 1. Of course, ANF and MNF are special cases 
of WANF and WMNF, respectively. 


Lemma 5.2. If sis in MNF, then 
surely $ =; s. 
Proor. Let s be of the form [];¥(i;, A;). Then 
surely s =; [] surely È (i;, A;) (by Lemma 5.1(c)) 

Zi I o-5-5 (i;, A;) (by definition of surely) 

zi I B-E -6 A;) (by de Morgan’s Law) 

zi I BT EKG, Aj) (by Lemma 5.1(c)) 

x I oT (i, Di\A;) (by Lemma 4.2(b)) 

Zi 1 im) =n D;\A;) (by de Morgan’s Law) 

= I 2 (i, Aj) . (by Lemma 4.2(e)) 

=; E i o 
Now we are ready to summarize the simplified method of computing the value of 

an arbitrary special term £. 

Step 1. Suppress nested occurrences of surely in ¢. 


Step 2. Replace every subterm surely s by a term in MNF externally equivalent 
to s. (The process of transforming any simple term into MNF by using axioms in B 
is quite standard and the details are left to the reader; see also [11].) 


Step 3. Transform the resulting term, which does not contain surely, into WMNF 
(see the proof of Lemma 4.3). 


Step 4. Determine the value of the resulting WMNF term by using Theo- 
rem 4.3. 
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It is not difficult to give a complete set of axioms (involving—unlike TB—only 
specjal terms) for the internal interpretation of special terms. 
Axioms for Special Terms Under Internal Interpretation. The set S of axioms 
consists of 
S1. Substitutions of special terms into the axioms of Boolean algebra and the 
axioms: 
(i) surely (t-s) = surely t- surely s, 
(ii) surely surely t = surely t, 
(iii) surely 0 = 0, 
(iv) surely 1 =1. 
S2. The following axioms concerning descriptors: 
(v) (Ø) =0, 
(vi) (i D) = 1, 
(vii) (4A)-(4 B)= (i ANB) forall E7, A, BE &,. 


S3. The axiom 


k Np k Np 
(viii) surely F (-« Ap) + Y (ip, ap) = 5 (ip, (D;\4p)U U BZ) 
p=l q=1 p=1 q=l1 


for every positive integer k, every sequence of positive integers m, ..., ma, 
every sequence of distinct attributes i, ..., iz, and all Ap, B}, ..., B? € Bi,» 
lspsk. 


The completeness of S can be proved in the same way as the completeness of TB; 
what is essential is that S enables us to transform any special term into WMNF. We 
leave the proof to the reader. 

As in the case of general terms, it may sometimes be convenient to use a dual 
axiom system, which is based on possibly as a primitive operation. The reader may 
easily verify that such an axiom system S* can be obtained from S by replacing 
axioms (i)-(iv) and (viii) by 


(i)* possibly (t + s) = possibly t + possibly s, 
(ii)* possibly possibly t = possibly t, 
(iii)* possibly 0 = 0, 


(iv)* possibly l= l, : 


np np 
(viii)* possibly Ī[] | (ip, Ap) - T] —<ip, z) ) = J] -(ip, (D; \4p) U U Bf). 
p=) q=1 p=1 q= 
The above axiom system is especially useful when we transform special terms into 
weak additive normal form. 


Example 5.1. Let us consider the following special term ¢: 


possibly ((DEPT# = 4) . (NAME = BROWN) 
+ surely ((NAME # LIPSKI) -(DEPT#¥ = 1))) 
-surely ((SAL < 15000) + (#CHILDREN > 3) 
- (SAL IN (10000, 20000)) - (STATUS = MARRIED)). 


We show below the process of transforming ¢ into WMNF. (We shall strictly follow 
the general method of transforming into WMNF described in this section, though in 
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our particular example there are places where the transformation can be done more 
efficiently.) 


t z; —surely— ((DEPT#¥ = 4). (NAME = BROWN) 
+ (NAME # LIPSKI) - (DEPT# = 1)) 
-surely (((SAL < 15000) + (#CHILDREN > 3)) 
*((SAL < 15000) + (SAL IN (10000, 20000))) 
*((SAL < 15000) + (STATUS = MARRIED))) 
=; —(((DEPT# # 4) + (NAME #¥ BROWN)) 
*((NAME = LIPSKI) + (DEPT#¥ # 1))) 
-((SAL < 15000) + (#CHILDREN > 3))-(SAL < 20000) 
*((SAL < 15000) + (STATUS = MARRIED)) 
=; (—(DEPT# # 4) .—(NAME # BROWN) 
+ —(NAME = LIPSKI) -—(DEPT# # 1)) 
*((SAL < 15000) + (#CHILDREN > 3))- (SAL < 20000) 
-((SAL < 15000) + (STATUS = MARRIED)) 
=; (—(DEPT# # 4) + —(NAME = LIPSK]I))-—(DEPT# IN (2, 3, 5)) 
-—(NAME = LIPSKI) .(- (NAME # BROWN) + —(DEPT# # 1)) 
-((SAL < 15000) + (#CHILDREN > 3))- (SAL < 20000) 
*((SAL < 15000) + (STATUS = MARRIED)). 


(In the transformation process we assume that Dprprw = {1, 2, 3, 4, 5}.) 


6. Computing the Value of Special Formulas 


In this section we develop a method for determining ||®|| for any special formula 
®. It is interesting that this method has a combinatorial flavor and is quite different 
from that of computing the value of a special term. The main idea of our method can 
be explained using the following simple example. 


Example 6.1. Suppose that three objects x, y, z are classified with respect to 
color. Assume that the color of no object is known, and consider the following two 
Situations: 


I II 
possibly green objects A, = {x, y} B, = {x, y} 
possibly red objects Ag = {x} Bo = {x} 
possibly blue objects Az = {x, y} B; = {x, z} 


(We do not exclude the possibility that an object is of another color than those listed 
above.) We may ask the following question: “Is it possible that all colors, that is, 
green, red and blue, are represented in our collection?” More formally, we ask for 
the value of the formula 


Possibly (( green # 0) A (red # 0) A (blue ¥ 0)). (19) 


It is easy to see that the answer for our question is “no” in case I and “yes” in case 
II. Indeed, in case I we have only the two objects x, y to represent three colors, while 
in case II it may be that x is red, y is green, and z is blue. In order to put this 
observation into more general terms, we need the following definition. A sequence 
Fi, - . - , Tn is Said to be a system of distinct representatives (SDR) of a sequence of sets 
Si,..., Sn if r: € S; for 1 sis n and ri # r for 1 si < j= n. Coming back to our 
example, we see that the relevant difference between case I and case II is that there 
is no SDR for A), A2, As, while there is one for Bı, B2, Bs, namely, x, y, z. This SDR 
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rovides an example of a possible completion of our information concerning the 
objects which makes formula (19) true. We may say that in a general situation of this 
type involving any number of sets Si, ..., Sn corresponding to some mutually 
exclusive properties—call them “colors,” the element r; of an SDR of Sı, ..., Sn 
lays the role of an object which “turns out to be of ith color.” 
The classical combinatorial theorem of Hall [4] asserts that an SDR for Si,..., Sn 
exists if and only if 


U S| ={J| for every JG {1,..., n}. 
jEJ 


This condition is not very interesting from the algorithmic point of view, but efficient 
methods of testing for the existence of an SDR do exist. The best known algorithm 
is given by Hopcroft and Karp [5] in an equivalent formulation in terms of matchings 
in bipartite graphs. It may be useful to describe briefly this alternative formulation 
of the problem. Let Si,..., Sn be subsets of X = {x1,..., Xm}. We construct a graph 
G with vertices corresponding to S;,..., Sn, X1,..-, Xm, With an edge {S;, x;} joining 
S, and x;, for all S;, x; such that x; E S;. By a matching we mean any set of edges with 
the property that no two edges in this set are incident to a common vertex. It is clear 
that n, ..., Fn is an SDR of Si, ..., Sa if and only if (Sj, rı}, ..., (Sn, rn} isa 
matching in G. 


Remark. The algorithm of Hopcroft and Karp constructs an SDR, while we are 
merely interested in its existence; it would be interesting to know whether testing for 
the existence of an SDR is strictly easier than constructing one. 


Now we show how to decompose the problem of computing the value of an 
arbitrary special formula into some number of problems of the type described in 
Example 6.1. To this end we need some definitions and lemmas. 

LemMa 6.1. For any formulas ®, Y, 


(a) Possibly (® V Y) =; Possibly © V Possibly ¥; 

(b) if ® =, F, then Possibly ® z; Possibly Y. 

Proof. The proof is analogous to the proof of Lemma 5.1(c) and (d). O 

We may replace every occurrence of Surely by —Possibly—, and then, by Lemma 
6.1(b), suppress all those occurrences of Possibly, surely, and possibly which are within 
the scope of Possibly. In this way we obtain a logical combination of special atomic 
formulas ¢ = s (t, s E A) and formulas Possibly ®, where ® is simple (i.e., it does not 
contain Surely, Possibly, surely, or possibly). The value of any special atomic formula 
can easily be computed by the methods developed in the previous section; it is 
convenient to make use of the fact that 

ne T if jts + s-t] = ©, 
F otherwise. 

Determining || Possibly Y ||, ¥ simple, is related to Example 6.1 and is much more 
difficult. We devote the rest of this section to showing how this can be done. 

Definition 6.1 


Gi) A formula is elementary if it is of the form 
(t = 0) A had (t ¥ 0), (20) 


where n = 0, to is in MNF, h, ..., fn are in ANF, and f)-t, Se 0 forO sp < 
gan. 
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(ii) A formula is in special disjunctive normal form (SDNF) if it is of the form 
NY, 
leL 
where all ¥?s are elementary. 


We treat formulas (20) lacking the atomic formula zo = 0 also as elementary; to be 
more formal, we could add in such a case a dummy atomic formula 0 = 0. j 
We prove that every simple formula can be transformed into an externally 
equivalent formula in SDNF. Let us first notice that for any simple formula ® there 
exists a finite collection of terms sı, ..., sp (p depends on ®) such that s;-5; =. 0 for 
1<i</j<p, and every term occurring in ® is externally equivalent to a sum of some 
number of terms s;. We shall call such a collection a set of atoms with respect to ®, 


One way to obtain sı, ..., Sp is the following. Let J be the set of attributes represented 
in ®. For any j € J, let i 
A;(®) = {4: (j, A) occurs in ©}, (21) 
€(®) = the set of all nonempty constituents of 4 (®). (22) 


(Recall that a constituent of a family {A;,..., An} of subsets of a set X is any set of 
the form Af! N --- MA where 6 ..., € E (0, 1}, and Af denotes A; if «i = 1 and 
X\A; if «i = 0 (see Kuratowski and Mostowski [10, p. 21]}).) In other words, 6; (®) is 
the set of atoms of the Boolean algebra of subsets of Dj generated by .o% (®). Now let 
us define 


AÖ) = fu (j, Aj): for every jE J, A; € sw). 
JE 


It is easy to see that ..(®) is a set of atoms with respect to ®. Sometimes it is 
convenient to consider instead of @(®) a partition of D; which is finer than the 
partition into nonempty constituents of .(®). For instance, for a real-valued attribute 
it may be useful to consider a partition into disjoint intervals, even if not all 
constituents are intervals. For simplicity, we denote such a finer partition by @(®), 
too. 


THEOREM 6.1. For every simple formula ® there is a formula ¥ in SDNF such that 
® =, Y. 


Proof. We describe an effective procedure for transforming an arbitrary simple 
formula ® into SDNF. 


Step 1. Replace every atomic formula ¢ = s occurring in ® by the externally 
equivalent formula t-—s + s.-—t = 0. 


Step 2. Using the propositional calculus axioms, transform the resulting formula 
into a disjunctive normal form W;®,, where every ®; is of the form 


A (uz=0)A A (ue ¥ 0). (23) 
keEK- kekr 


Now it is sufficient to show how to transform each such conjunction into SDNF. 


Step 3. Using the equivalence 


/A (m= 0) =( > u) =0, 
kEK- kEK- 


: On Databases with Incomplete Information 63 


lace the first part of (23) by the atomic formula to = 0, where fo is the result of 
ai nsforming Drex- u into MNF (for transforming into MNF, see [13]). 


Step 4. Replace the resulting formula, that is, 
(t= 0)A M (ux ¥ 0), (24) 
kEK* 


by the (externally equivalent!) formula 
(t0 =0)A MM (ur —to * 0), 
REK? 


and then by 
(to=0)A M (ve #0), (25) 
keK+ 


where vx is the result of transforming uz. —t into ANF. 
Step 5. Let O be the second part of (25), that is, 
A (vk Æ 0), 
kEK- 


and let J be the set of attributes represented in O. For every j € J, determine ¢(0) 
(see (21), (22)). Since every vx is in ANF, it is a sum of primitive terms of the form 
[Ler (j, 4;), P € J. Transform each such primitive term into J] jez (j, 4;) by adding 
factors (j, D;) (=el) for every j E J\P. Now replace every factor (j, Aj) by the 
externally equivalent sum 
(i, C) 
(CCA) A(CEG(O)) 

(notice that A; = U{C E €({©):C C A,} and that we have axiom (ii) of group B2), 
and then, using the distributive law, transform the resulting product of sums into a 
sum of atoms with respect to O. By applying the above transformation to every 
primitive term in v; and then suppressing repeated summands, we transform vy; into 
a sum of distinct atoms with respect to ©, say Dsem, Se. 


Step 6. Using the equivalence 
( by a) 40, W (sz # 0), 
kEM, REM; 
transform (25) into 
(t =0)A MA W (s #0). 
JEK* REM; 


Applying the (logical) distributive law and then suppressing repeated formulas 
5s Æ 0 within every conjunction, we ultimately arrive at a disjunction of elementary 
formulas, that is, SDNF. (It may then be useful to suppress repeated elementary 
formulas.) © 


Example 6.2. Let ® be the formula 
((SEX = F) # (AGE = 25)) 
A [((AGE < 30) - (SAL > 30000) = 0) A ((SEX = M) = 0) 


V (AGE 2 25). (SAL < 15000) - (SEX = F) £0) 
A (SEX = M} (SAL = 15000) # 0)]. 26) 
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Below we show some of the stages of transforming ® into SDNF: 


® =, ((SEX = F). (AGE < 25) + (SEX = M)-(AGE = 25) # 0) 
A (AGE < 30) - (SAL > 30000) = 0) A ((SEX = M) = 0) 
V ((SEX = F). (AGE < 25) + (SEX = M)-(AGE = 25) #0) 
A ((AGE = 25). (SAL < 15000). (SEX = F) # 0) 
A ((SEX = M)+(SAL = 15000) # 0) 
Re (((AGE < 30) + (SEX = M))-((SAL > 30000) + (SEX = M)) = 0) 
A (SEX = F) «(AGE < 25) - (AGE = 30)-(SEX = F) 
+ (SEX = F)-(AGE < 25). (SAL < 30000) - (SEX = F) 
+ (SEX = M). (AGE = 25)-(AGE = 30)- (SEX = F) 
+ (SEX = M). (AGE = 25) . (SAL < 30000) - (SEX = F)) ¥ 0) 
V ((SEX = F}. (AGE < 25) ¥ 0) 
A ((AGE 2 25). (SAL < 15000) - (SEX = F) # 0) 
A ((SEX = M). (SAL = 15000) ¥ 0) 
V ((SEX = M) - (AGE = 25) # 0) 
A ((AGE = 25). (SAL < 15000). (SEX = F) ¥ 0) 
A ((SEX = M). (SAL = 15000) ¥ 0). 


The first two parts of the above disjunction are already elementary formulas (strictly 
speaking, the first one becomes an elementary formula after some simple reductions). 
Let © be the third part. We have 
EacelO) = {(0, 25), [25, œ)}, 
€sar(@) = {[0, 15000), [15000, -)}, 
Gsex(O) = ({F}, {M}}, 
O =. ((SEX = M)- (AGE > 25)-((SAL < 15000) + (SAL = 15000)) # 0) 
A ((AGE = 25)-(SAL < 15000) - (SEX = F) # 0) 
A ( (SEX = M). (SAL = 15000) 
-( (AGE < 25)+ (AGE = 25)) # 0). 


After applying the distributive law and performing some simple reductions, we 
finally obtain the SDNF of ®: 


(((AGE < 30) + (SEX = M))- ((SAL > 30000) + (SEX = M)) = 0) 
A ((SEX = F) . (AGE < 25) « (SAL < 30000) # 0) 
V ((SEX = F) - (AGE < 25) #0) 
A (AGE = 25) . (SAL < 15000) - (SEX = F) # 0) 
A ((SEX = M) - (SAL = 15000) # 0) 
V ((SEX = M) - (AGE = 25) - (SAL < 15000) # 0) 
A (AGE = 25) « (SAL < 15000) - (SEX = F) ¥ 0) 
. A ((SEX = M) - (SAL = 15000) - (AGE < 25) # 0) 
V ((SEX = M) - (AGE = 25) - (SAL < 15000) # 0) 
A ((AGE = 25) « (SAL < 15000) - (SEX = F) ¥ 0) 
A ((SEX = M) - (SAL = 15000) - (AGE = 25) # 0) 
V ((SEX = M) - (AGE = 25) - (SAL = 15000) # 0) 
A ((AGE = 25) - (SAL < 15000) - (SEX = F) ¥ 0) 
A ((SEX = M) - (SAL = 15000) - (AGE < 25) # 0) 
V ((SEX = M) - (AGE = 25) - (SAL = 15000) # 0) 
A ((AGE = 25) - (SAL < 15000) - (SEX = F} # 0). (27) 


Notice that in order to be more efficient, we did not exactly follow the general pattern 
of transformation described in the proof of Theorem 6.1. 


All we need now is a method for computing the value of Possibly ®, ® elementary. 
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Indeed, let us transform a simple formula ¥ into an externally equivalent formula in 
SDNF, say Wier, where the Y; are elementary. By Lemma 6.1, 


Possibly Y =; Possibly WY, =; W Possibly Vi. 
lek leL 


The last step is provided by the next theorem. . 
THEOREM 6.2. Let ® be an elementary formula of the form 
(to = 0) A m (te ¥ 0). (28) 


Then || Possibly ® ||- = T if and only if the following two conditions are satisfied: 


(i) || surely toy = Ø; 
(ii) the sequence || possibly tils, ..., || possibly tn||y has an SDR. 


ProoF. If || Possibly ®||+= T, then by (18) there is a completion Y’ => Z such that 
| @ll- = T; that is, |] folly. = Ø and | telly = Ø, 1 = k <n. By (15) it follows that 
|| surely tol- = Ø, and since t+ tg =. 0 for 1 £ p <q £n, the sets |ti ls... Itale 
are mutually disjoint, so that we may choose distinct elements x; E |[hfly, ..., 
Xn E || fall’. But (16) implies | tz l+; C |] possibly tg |v; hence xı, . . . , Xn is an SDR of 
the sequence || possibly tilly, ..., || possibly tn |] 

Conversely, suppose that || surely fo||~ = Ø and that x1,..., xn is an SDR of the 
sequence || possibly tilly,..., || possibly trl. Then by (15) and (16) there exist 
completions R, A, ..., AM such that || loll, = 0, xx E Iteli 1 = k = n. Let (Bier 
0=k =n, correspond to 4, 0 < k < n, respectively. We define a completion 2” by 

pos fe if x = Xk forsome k, ISksn, 
Bi(x) otherwise. 


It is easy to see that xz E || tello, 1 sk £ n, and | lly. = Ø (xe E lolly: since x: E 
\ltell-- and to- tk Ze 0). Consequently, || ®||. = T, that is, || Possibly © |y= T. O 


Notice that in the proof we did not make use of the fact that to was in MNF and 
that the fx, | = k <n, were in ANF. In fact, these assumptions in the definition of an 
elementary formula (see Definition 6.1(i)) were made only to make computing the 
value easier: if fo is in MNF, then by Lemma 5.2, 


Il surely tol] = || toll; (29) 
similarly, if f» is in ANF, say } p [Tq (ipg, Apg), then 


Il possibly tel] = 


—surely TT X (ings Di, \Apa? 
2P q 


z | -I È (ipa, Di, \Apa) 
Pa 


: (30) 


z H = (ipg Di,,\Apa) 
P a 


Example 6.3. Consider a very simple system represented by the following table: 


object AGE SAL SEX 
x) (20, 40) [0, œ) {M} 
X2 {35} [30000, 40000) {F, M} 
Xs (30, 40) [20000, 30000] {F} 


Xa [20, 30) (10000, 20000) {F} 
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We shall compute || Possibly ||, where ® is given by (26). First we transform ® 
into SDNF, which yields formula (27). This formula is a disjunction of six element 
formulas, ®, V --+ V ®e, where ©; is of the form (tio = 0) A ANELi(tiz ¥ 0) (tio may be 
0). Using (29), (30), and Theorem 4.3, we obtain 


|| Possibly |] = F, since surely hol] = {xı} # Ø, 
|| Possibly ®2|| = F, since no SDR exists for 
|| possibly tar|| = {x4}, || possibly tə2|| = {x4}, || possibly teal = (21, x2), 
|| Possibly B3|] = F, since no SDR exists for 
| possibly trl = {x1}, || possibly tel] = (xa), || possibly tall = {xı}, 
|| Possibly B,|| = T, since xı, x4, x3 is an SDR for 
| possibly tal] = {x1}, || possibly tall = {x4}, || possibly taal] = (xy x2). 


Consequently, || Possibly ®|| = T (we need not consider ®; and ®g). l 


It is easy to see that the length of the SDNF may, in general, grow exponentially 
when the length of a formula gets large. Notice, however, that any straightforward 
method of evaluating || Possibly ®||- based on enumerating all completions of ¥ 
would be incomparably worse, since the number of such completions is, in general, 
an exponential function of the number of objects. (Strictly speaking, the number of 
completions may be infinite when an attribute domain is infinite. However, we do 
not need the exact value of an attribute j in a completion; it is sufficient to know the 
subset C E @(®) to which it belongs.) For obvious reasons, our simple example 
could not illustrate the fact that in a real database the number of objects is usually 
several orders of magnitude bigger than the length of a query. 

Some ways of improving the efficiency of our method of evaluating || Possibly © || 
are listed below: 


(1) In order to transform (25) into SDNF we may repeatedly apply the equivalence 
(v # 0) A (vı ¥ 0) Ze (vk. vı Æ 0) V ((vk Æ 0) A (ve + -v #Æ 0)) 


and the distributive law to formulas (v, # 0), (vı # 0) such that v,- v: že 0. The 
resulting SDNF contains, in general, fewer elementary formulas. In particular, we 
skip Steps 5 and 6 whenever (25) is already an elementary formula. 

(2) We need not transform (24) into SDNF if we find that || surely to|| Æ Ø (the 
value of || Possibly ¥ ||, where ¥ is given by (24), is then F). After having generated 
an elementary formula ®, we may evaluate || Possibly ®,||, and if it is T, we need not 
continue the transformation process (the value of the whole formula Possibly ® is 
then T). 


(3) Testing for the existence of an SDR can be simplified by using the following 
simple combinatorial facts: 


(a) If Se = Ø for some k, then no SDR exists for S),..., Sp. 
(b) If Sk\(S) U <+. U Sk-1 U Spe) U oes U Sn) Æ Ø, then 


Si,..., Sn has an SDR © Si, ..., Sp}, Skan ..-, Sa has an SDR. 
(c) If | S,| = n, then 
Si,..., Sn has an SDR © Si, ..., Spi, Ska o., Sn has an SDR. 


(a) corresponds to || possibly t; || = Ø, and (b) to || surely t; ||. Ø, since tp - ti Ze 0 
implies 


|| possibly t |N|] possibly ti|| D || surely tr |N] —surely ~t; | 
= ||surely ti|| N ||surely—ti]| = || surely tr - ~til = || surely || ¥ Ø. 
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(4) If fo in 0 is an elementary formula ¥ and || Possibly Y || = T for a subset of our 
collection of objects, then || Possibly Y|| = T for the whole collection. It is not 
necessarily so when to is not 0; then the existence of an SDR for || possibly tl, .. -, 
|| possibly t,|| is preserved under adding new objects, but the condition || surely toll = 
@ may be violated. 


By the last remark, we may infer that the value of Possibly ®, where ® given by 
(26), is Tin any system which—when represented by a table—contains the four rows 
from Example 6.3. 


Example 6.4. Let ® be the formula 


(((HAIR = FAIR) - possibly (SEX = F)) = 0) 
A Surely ((SAL > 50000) » (TAX < 5000) = 0). 


The first part of this formula can be transformed into 
(HAIR = FAIR) - ~(SEX = M) = 0 


(the left-hand side is in both WANF and WMNF, which enables us to easily compute 
its value in any system). In order to evaluate the second part of our formula, we 
might exploit the general method based on transforming into SDNF. It is, however, 
not necessary in our simple example (neither is it necessary in most of the queries 
likely to arise in practice). Instead, we may use the first of the following four 
equivalencies: 


Surely (t = 0) =; (possibly t = 0), 

Surely (t #0) =; (surely t ¥ 0), 
Possibly (t = 0) ~; (surely t = 0), 
Possibly (t # 0) =; (possibly t # 0), 


(t is an arbitrary special term; the easy proof of these equivalencies is left to the 
reader). We have 


possibly ((SAL > 50000) - (TAX < 5000)) =; — (SAL = 50000) - — (TAX = 5000) 
(see axiom (vili)*), and consequently our formula is transformed into the form 


((HAIR = FAIR) - —(SEX = M) = 0) 
A (—(SAL < 50000) . - (TAX = 5000) = 0), 


which easily lends itself to evaluation in any system. 


In the above example we have eliminated Surely from our formula. This is, 
however, not always possible. The reason for this impossibility is, very roughly 
speaking, the fact that the existence of an SDR is not a “Boolean” property; there 
may be two sequences, S3, ..., Sn of subsets of X and TM), ..., Tha of subsets of Y, 
such that 


SUM +++ NSPHDSETIN---NTTP=OH 


for all gq, ..., €n © {0, 1}, yet Si, ..., Sn has an SDR, while 7), ..., Ta does not 
have any SDR. This is so, for instance, in the case of the sequences {x, y}, {x, y} and 
{x}, {x}. It is easy to see that formulas not containing Surely express “Boolean” 
properties of the values of special terms. On the other hand, we know (see Theorem 
6.2) that there are special formulas of the form Possibly ¥ which express the existence 
of an SDR—a “non-Boolean” property—for the values of some special terms. Of 
course, no such formula can be replaced by any (internally) equivalent formula not 
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containing Surely. For example, let us consider a system with Bsex(x) = {F, M} = 
Dsex for all x E X, and let ® be the formula Possibly (( (SEX = F) # 0) A 
((SEX = M) # 0)). Of course, if |X| = 1, then ||®|| = F, and if |X| = 2, then 
| P| = T. However, the value of any formula not containing Surely is the same in 
both cases. Indeed, the value of any special term is either Ø in both cases or X in 
‘poth cases, and hence the value of any atomic formula is the same in both cases. 

To conclude this section, let us mention that the values || surely t||, || possibly t|ļ, 
|| Surely ®||, and || Possibly ® || (for simple ¢ and ©) were denoted in [13] by liry, ll¢ll*, 
llls and ||®||*, respectively (|| - ||, and ||- ||* were called the lower value and the 
upper value, respectively). Our algorithm for computing || Possibly ®|| gives a method 
for determining ||®||* (and ||®||,, since ||®|],, = || 7®||*), a problem which was left 
open in [13]. 


7. Other Interpretations of Queries 


There us another approach to semantics of queries in an incomplete information 
system, based on the theory of pseudo-Boolean algebras (PBAs) and intuitionistic 
logic. (The relation of PBAs to intuitionistic logic is exactly the same as the relation 
of TBAs to modal logic S4; see [3, 15].) We describe only the pseudo-Boolean 
approach to interpreting terms (intuitionistic interpretation of formulas is considered 
in [11]). In the pseudo-Boolean approach we consider only simple queries, and we 
treat “—” as a “strong” negation, that is, —t is understood, roughly speaking, as the 
set of objects known not to have property t, instead of just not known to have property 
t. Also the interpretation of “—” is “strong.” The formal definition of the “pseudo- 
Boolean value” of a simple term :—denote it by |t|,—-can be obtained by changing 
(iii) and (vi) in Definition 3.1 to i 


|-t|y = {x E X: for every F’ > Fx E ltl), 
|t—> sly = {x E X:for every F’ > Z x Ẹ |t| orx E {sl} 


(and deleting (vii)). It can easily be shown that for any simple term #, 


lels = || 7) lv, 
where 7(#) is defined inductively by 


(i) 7(0)=0, 71) =1, 
(ii) T(t + s) = 7(t) + 7(5), 
(iii) z(t. s) = 7(t)- 7(5), 
(iv) 1(-1) = D-7), 
(v) t> s) =O) > 7(5)). 


It is also not difficult to prove that 


|| surely tls = |-—tly, 
|| possibly tly = X\|—t|y. 


Similar relations between || - || and | - | exist for formulas. These relations reflect the 
fact that the set of open elements of any TBA (a is open if a = Hla) forms a PBA, and 
they are connected with the well-known interpretation of intuitionistic logic within 
the modal logic S4 (see [3, 15]). It may be noted that there exists a similarity between 
a “pseudo-Boolean value” and the Kripke models for the intuitionistic logic (Kripke 
[9]; see also Fitting {3]). It can also be shown that | - |+ coincides with the interpretation 
of terms defined—in a quite different way—by Jaegermann [7]. 
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In the internal interpretation of queries presented in this paper we consider, for 
any incomplete system % the set of all extensions of Z Since J represents an 
incomplete knowledge about a reality described by a completion “* of Z not all of 
these extensions are really possible. The only completions accessible in reality are 
those consistent with *, that is, those of which * is a completion. Hence, we may 
consider a different approach, where the partially ordered set of all extensions of Z 


is replaced by 
o [EF] {SLP Sf"). 
Of course, in general we do not know S *. However, there are formulas, such as 
Hor = OL, OO e O04, 


which are true for any “and for any completion /* of Z (The interpretation of a 
term and of a formula is now the same as in Definition 3.1, but with “for every 
S' =F...” replaced by “for every S’ E [5 ¥*] ....”) These universally valid 
formulas define a logic, quite different from the one inherent in our internal 
interpretation described in Section 4. Notice that this logic describes the process of 
increasing the user’s (system’s) knowledge, as seen by an observer who has complete 
information about both the system (S) and reality (2 *). 


8. Conclusions 


Following a treatment of more elementary topics in [13], we have given a thorough 
treatment of the internal interpretation of queries. It has turned out that this 
interpretation leads in a natural way to the notion of a topological Boolean algebra 
and to a modal logic related to S4. These notions have been shown to play the same 
role for the internal interpretation as Boolean algebras and classical logic in the case 
of external interpretation. We have presented a complete axiom system for internally 
equivalent transformations of terms and a method for computing the internal 
interpretation for arbitrary terms and for a broad class of formulas including most 
formulas of practical interest. 

The number of steps performed by our algorithms may, in the worst case, be an 
exponential function of the length of the query. However, they seem to be of practical 
interest for real world queries that are likely to be submitted to a database. 

There are a number of interesting problems which remain open. One such problem 
is to investigate in more detail the logic involved in the internal interpretation of 
formulas. More specifically, it is not known to the author whether this logic is 
decidable (it may be noted that the results of Section 4 can easily be used to develop 
a decision procedure for atomic formulas). Another question is whether there is a 
simple axiom system for this logic. Both problems are open even for the sublanguage 
considered in Section 5. 


Note added in proof. The logic involved in the internal equivalence of formulas has 
recently been shown to be decidable; see H. Ono and A. Nakamura, Decidability 
results on a query language for data bases with incomplete information, Proc. 9th 
Int. Symp. on Mathematical Foundations of Computer Science, P. Dembinski, Ed., 
Springer-Verlag, Berlin, 1980, pp. 452-459. 
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